Modify pipeline for whole transcriptome using HUGO names by gadenbuie · Pull Request #4 · GerkeLab/PAM50

gadenbuie · 2018-07-30T17:31:17Z

I modified the PAM50 pipeline to use HUGO names instead, pulling probeset-to-gene mappings from the latest HuEx-1_0-st-v2 Probeset Annotations (released 7/16/16). This file is behind a login but was downloaded from http://www.affymetrix.com/Auth/analysis/downloads/na36/wtexon/HuEx-1_0-st-v2.na36.hg19.probeset.csv.zip

The HUGO name lookup was performed with the HUGO gene symbol names curated at genenames.org and downloaded from http://beta.genenames.org/download/custom.

The SUMMARIZE_FUNCTION option variable can be used to set the function that summarizes gene expression when multiple probesets are mapped to a gene name. You can pass a summary function from base R here (e.g. median, mean, quantile), or you can write your own function as long as it takes a vector as it's first argument and produces a single value output.

The scripts gather GSE46691 files into the data subfolder. If you have previously downloaded these elsewhere copy or link them into this folder and the scripts will skip downloading.

Finally, I also realized that the previous script relies on biogroom which is currently still a private repo, so I imported functions as needed for the final step of extracting and cleaning the phenotype data.

Modify pipeline for whole transcriptome using HUGO names

c6536af

gadenbuie requested a review from tgerke July 30, 2018 17:31

gadenbuie added 4 commits July 30, 2018 13:48

no need to duplicate checkpoint of exprs data

bb1a743

Minor typo (extra 6 in gse accession number)

d60e713

Update matching on hugo name

2ad0eba

Fix typos in refactoring

aa1e03e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify pipeline for whole transcriptome using HUGO names#4

Modify pipeline for whole transcriptome using HUGO names#4
gadenbuie wants to merge 5 commits intomasterfrom
mayo-545

gadenbuie commented Jul 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gadenbuie commented Jul 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant